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1. Introduction 


Traditionally researchers have defined the problem 
of comprehending anaphoric expressions as one of 
determining ie antecedent of an anaphoric expression, 
that is, determining to which word or phrase an ana- 
Recent studies 


phoric expression refers or “points” 
in both artificial intelligence and linguisties have dem- 
onstrated the need for a theory of the comprehension 


ms, a theory that wecounts for 


of anaphoric expre 
ic and semantic effects, as well as 


the cole of syntac 
inferential knowledge in explaining how anaphors are 
understood En this paper a new theory, based on the 
concept of focusing in the discourse, is introduced to 
explain the interpretation of pronouns, 


Before a theory can be given, and before even the 
difficulties in interpreting anaphors? can be discussed, 
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we must first re-consider what an antecedent is. The 
traditional definition encounters difficulty right from 
the start, it is founded on the notion that one word in 
a sentence refers or points back to another word or 
phrase in the (same or another) sentence. But words 
don't refer back to other words {Morgan 1978]; people 
use words to refer to entities in the world. In particu 
lar they use pronouns to refer to entities which have 
already been mentioned in a discourse, Since an ana- 
phorie phrase does not refer to an antecedent, one 
might want to chim that both the antecedent and the 
anaphor co-refer to the same entity, That description 
is adequate for sentence si, 


sl Lthink green apples taste best and sher 
make the best cooking apples too. 
though noi for discourse D1, where there is no ante- 
cedent phrase in the discourse that co-refers with the 
pronoun “they.” 
DI-1 My neighbor has a monster Harley 1200. 
2 They are really huge but gas-efficient 
bik 
Rather than view antecedence as co-reference, one 
might propose that santecedence is a kind of cognitive 
pointing, the kind of pointing that causes “they” and 
“green apples” to pomt (somehow) to the same class 
of entities in one's mind, This proposi is problematic 
for the same reason that co-reference is: people use 
pronouns when there is no other noun phrase in the 
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discourse that points to the right mental entity. In DI, 
“they’’ refers to bikes which are Hurley 1200's as a 
group, while “a monster Harley 1200" mentions only 
some particular Harley 1200. “They” seems 10 be 
able to refer when used with the previously mentioned 
phrase “a monster Harley 1200” without the two 
phrases cither co-referring or co-pointing. 


If an anaphor does not refer to an antecedent 
phrase, and if it need not always co-refer with its ap- 
parent antecedent (as in D1), then anaphor interpreta- 
tion is not simply finding the antecedent. Nevertheless 
the concept of antecedence as pointing back does 
seem to capture some aspect of the comprehension of 
anaphors, for when certain antecedent words are miss- 
ing from a discourse, people often fail to understand 
what is being said. 


Let us define the problem of interpreting and un- 
derstanding an anaphor in the following way. The 
phrase ‘green apples” in sl, when syntactically and 
semantically interpreted, is said to specify a cognitive 
clement in the hearer’s mind. In the computitional 
model of that process, this element is a database item, 
which might be represented by Apples2 in by the sche- 
ma below: 


Phrase76: 
string “green appies” 
context: speaker! think * tastes best 
a] spe 
Specifies: Appies2 
Apples2: 
super-concept: apples 
color: green 
used-for: cooking 


The speaker uses the information in a cognitive repre- 
sentation like Apples2 above to choose the phrase 
“green apples” in sl. The hearer then uses the phrase 
“green apples” plus the syntactic and semantic inter- 
pretation of the rest of the sentence to locate a similar 
cognitive element in his own mind: it may be slightly 
different because the hearer may not associate use in 
cooking with green apples. A cognitive element, such 
as Apples2, is called the specification of “green ap- 
ples.” These elements, present in the memories of 
speaker and hearer, arc of course related to other cog- 
nitive elements in their memories. 


What is the relation of specifications to the real 
world? One might like to claim that a reference rela- 
tion exists between specified cognitive elements and 
objects in the world, but since referring is what people 
do with words, this relation is problematic for cogni- 
tive elements. Insicad, specifications arc said to 
represent the objects referred to; that is, they bear a 
well-structured correspandence to objects in the world, 
Apples2, the specification of “green apples,” repre- 
sents the objects that are green apples. For phrases 
such as “Santa Claus,” where there is no real world 
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object to represent, a specification represents the men- 
tal schema to whieh are attached the properties nor- 
mally associated with this imaginary person. 


The phrase “they” in st also specifies a cognitive 
element, namely the same one that “green apples” 
docs. Since the two bear the same relation to the 
representation Appies2, I say that they “co-specily” 
that memory clement, or alternatively, that the inter- 
pretation of “green apples” is the co-specifier of the 
interpretation of “they.” Co-specification, unlike 
co-reference, allows one to construct abs ct repre- 
sentations and define relationships between them 
which can be studied in a computational framework. 
With co-reference no such use is possible, since the 
object referred to exists in the world and is not availa-* 
ble for examination by computational processes. 


Even if a phrase and a pronoun do not co-specify. 
ihe specification of the phrase may be used to generate 
the specification of a pronoun. For example, in DI 
“they” dacs not co-specify with the apparent unteced- 
ent phrase “a monster Harley 1200," but rather it 
refers to the chiss of Harley 1200°s of which the ap- 
parent antecedent is an instance. Thus anaphor inter- 
pretation is not simply i matter of finding the corre- 
sponding cognitive clement that serves as the specifi- 
cation of the anaphor; some additional process must 
generate a specification for the anaphor from the rela- 
ted phrase ta monster Harley 1200." 


The concepts of specification and co-specification 
capture the “pointing back" quality of antecedence, 
and also permit us to formulate an explanation of ana- 
phor interpretation that avoids the pitfalls of the con- 
cept of antecedence. Anaphor interpretation can be 
studied as a computational process thut uses the al- 
ready existing specification of a phrase to find the 
specification for an anaphor. The process uses a rep- 
resentation of the discourse preceding the anaphor to 


encode the syntactic and semantic relationships in “e@ 


sentence as well as co-specification relationships be? 
tween phrases? ‘These delinitions in themselves do 
not constitute a theory of anaphor interpretation 
They do, however, make possible a succinet statement 
of the problem: how does one determine the specifi- 
cation of a anaphor? Also, since we suspect that the 
specification of an apparent antecedent phrase plays 
some role in choosing an anaphar's specification, we 
may ask, just what is this role? We hope for a direct 
answer lo these questions, but before one can be giv- 
en, let us consider how a theory of interpretation 
ought to address these questions. A brief look at the 

Sin the rest of this paper, 1 speak of a phrase cospe 
Cor speealying) with another plirase, when what t really mei 
the relution is between representations of phrises thal h 
interpreted by some painy pi y which ides the sentence 
syntactic relations, and by a semantic interpretahon process, which 
compulit: 
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fifficulties of choosing co-speeiliers suggests which 
ssues our theory should cover. 

Determining the co-specifier of un anaphor is diffi- 
cult because there are i multitude of possible co- 
specifiers in a given discourse. and (here is no simple 
way lo choose the correct one. Yet human hearers 
and readers generally do recover the correct CO- 
specifying phrase intended by the speaker. Human 
readers and hearers also fail to recover the co- 
specifying phrase in certain situations; this behavior 
just as valuable an observation as garden path phe- 
nomena for theories of parsing. A theory of interpre- 
tation must predict the pattern of the hearer's and 
reader's correct and incorrect choices well as fail- 
ures to understand, by a rule-poverned account In 
addition, a Gixonomy of the cases in which specifica- 
tions are used to generate other specifications must be 
given, as well as a means of predicting their distribu- 
lion. 


Orvo other aspects of communication make it diffi- 
cult (o find co-specifiers: the context of discussion and 
the inferences people make. People use the context 
surrounding an anaphor in understanding it I a theo- 
ry of anaphor interpretation is to capture understand- 
ing, it must include a means of encoding discourse 
context and whatever structure it has; the context 
must be distilled into a form that preserves ils richness 
without adding overwhelming complexity to the inter- 
pretation process. In addition, researchers have dis- 
covered that anaphor interpretation involves making 
inferences, some of which can be complex, cach of 
which must be chosen from a large base of knowledge 
about objects, people and things. The practical de- 
ployment of inferential capabilities Tor any task re- 
quires control: knowing what lo infer when, and 
knowing when to stop, Since the general control 
problem is poorly understood, a theory of anaphor 
Werpretation must provide solutions to the mare spe- 

ic problem of controlling inference in anaphor inter- 
pretation, 


The major portion of this paper addresses the is- 
sues of determining the specifications of anaphors, 
with an emphasis on the role of context and inference. 
The theory of focusing on parts of the discourse is 
introduced, and a processor and its rules whieh rely on 
the focusing theory are discussed, Before turning to 
this discussion, the previous and extensive research on 
anaphora are reviewed. 


2. Research on Anaphora 


The role of context and inference, as well as syntax 
and semantics, on anaphor interpretation have been 
explored extensively. AT briel look it these explora 
tions indicates the necessity of a new approach, Re 
search on anaphora falls into four broad categories: 
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1. General heuristics for finding antecedents 

[Winograd 1972] 

Syntactic and semantic constraints on anaphora 

[Katz & Fodor 1963, Woods et al. 1976, Chomsky 

1976, Lasnik 1976, Reinhart 1976, Walker 1976] 

3. Use of inference to find antecedents {Charniak 
1972, Rieger 1974, Hobbs 1976] 

4. Analysis of relations among objects in a discourse 
context [Grosz 1977, Lockman 1978, Reichman 
1978, Webber 1978, Hobbs 1979] 

Rather than review cach approach, I point out the 

contributions of cach type to a theory of pronoun 

interpretation, 


v 


General heuristics, as a means of choosing anteced- 
ents, predict reliably in a large number of typical ex- 
amples. However, no simple characterization Ms the 
wide variety of cases where they fail (sce Winograd 
1972 and Hobbs 1977), furthermore, the heuristic 
approach is nol theoretically grounded and cannot 
offer a unified approach to the phenomena. 


Semantic selectional restrictions, based on the Katz- 
Fodor theory of semantic markers, and used by many 
computational linguists, can reduce the space of possi- 
ble antecedents, but they cannot be used to eliminate 
all possibilities, as the example below illustrates 
(where feeling soft can be said of either a mud pack or 
one’s face): 

s2 Put the mud pack on your face. Notice 
how soft it feels. 
Syntactic restrictions, on logical form [Chomsky 1976] 
and on constituent structure [Lasnik 1976, Reinhart 
1976], stipulate conditions in which a pronoun and a 
noun phrase must have disjoint reference, as shown 
below. 
s3 * Near Dan, he saw a smake. 
s4 * The man whose house he bought went 
gold digging in Alaska, 
These rules, however, do not stipulate the interpreta- 
tion of a pronoun; in a general theory they serve only 
as a filtering condition on the class of possible co- 
specifications. Furthermore, syntactic restrictions 
must also stipulate the disjoint reference conditions on 
reflexive pronouns although no ndequate account of 
these conditions has yet appeared, 


Work by researchers in arti ial intelligence on 
inference led to methods for forward and backward 
chaining of inferences to bind the pronoun, represent- 
ed as a free variable, with some piece of knowledge. 
with this approach, the pronoun’s interpretation was 
the value bound to the free variable. This approach 
revealed that inferences about warld knowledge are 
often needed to interpret pronouns, However, these 
methods failed to control the inference process suffi- 
ciently, Charniak, altempting to resolve this problem, 
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Propased demons that would “wake up" in the appro- 
Pridte situation (that is, processes which could them- 
selves notice when they were to begin processing) 
But a large cache of demons would be required. and 
no assurance could be given that demons would exist 
in every situation. Most significantly, this proposal 
said| nothing about the situation where two or more 
demons might apply (who gets control? how are the 
decisions made?). Furthermore all of the inference- 
based approaches to Pronoun interpretation fail to 
offer any theoretical approach because they rely on a 
simple mechanism, (simple variable binding between 
Pronoun and some other phrase) which docs not apply. 
in ay uses of anaphors, such as DI. 


Discourse approaches vo anaphora include a techni- 
que similar to the inference method; one’ identifies 
sentence pairs and determines their semantic relation- 
ship as one of elaboration, similarity, contrast, paralic! 
Structure; the pronouns are interpreted by variable 
binding between items of the sentence pairs [Hobbs 
1979]. Webber 1978, using a notion of “discourse 
identifications” (that contain certain semantic and 
discourse content) similar to the notion of specific a- 
tion, ktipulates constraints on the representation of 
relations among items mentioned in a discourse. 


Grosz [1977, 1978, 1982] illustrates how parts of a 
speaker's knowledge, relevant to a discourse segment, 
are highlighted via focusing, a process that reflect; 
what a speaker Says and the nature of the knowledy : 
in the space. She shows that the structure of a tasé 
affects what items will be focused on in the discourse. 
Reichman has expanded this paradigm by describing 
context spaces, delincated by their topics. Her analysi ; 
shows that within a context space only certain items 
may be pronominalized. She leaves open the ques- 
tions: What is the recognition procedure for determin- 
ing a context space? How does one identify its topic? 
How does the hearer determine the interpretation of a 
Pronoun, i.e., how does a hearer decide which highly 
focused items act as the co-specification of a pronoun? 
All these approaches support the view that since hear- 
ers do not have Privileged access to a speaker's mind, 
other than through what a speaker says, imposing 
structure on the speaker's discourse will provide a 
framework for establishing the interpretation of pro- 
nouns, 

The remainder of this Paper defines the concept of 
speaker's foci and shows that they can be used to 
choose specifications for personal pronouns. The rules 
for choosing interpretations are stated within a frame- 
work that shows: 

e how to control search in inferring by a new me- 
thod balled constraint checking, 

« how to take advantage of syntactic, semantic 
and discourse constraints on interpretation, 
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+ how to generalize the Ireatment of personal 
pronouns, to serve as a framework for the theo- 
ry of interpretation for all anuphors, 


3. Focus for Pronoun Interpretation 
A simple discourse can be used to illustrate how 
Pronouns indicate what the speaker is tatking about in 
the discourse. In the sample below, the speaker men- 
tions two dogs and tells us something about each, 
D2-1 
2 (The) one is a Poodle: 


l have two dogs, 


3 the other is a cocker spaniel, 

4 The poodle has some weird habits, 

5 He eats plastic Mowers and likes to sleep 
in u paper bag. 

6 It's a real problem keeping him away from 
plastic flowers. 


7 The cocker is pretty normal, 
8 and he's a good watch dog. 


9 like having them as pets. 

The speaker uses the pronouns “he! and “him” to 
indicate that the poodle is the actor of cating flowers 
and the object of an action of Keeping. The speaker 
mentions his second dog with “the other” and then 
uses the pronoun “he” only in predicate adjective and 
predicate nominative constructions without reference 
to any events. Initially the speaker focuses attention 
On both dogs and then turns his attention to each fora 
part of the discourse. The Process by which the 
Speaker uses language to indicate his focus is called 
the focusing process. ‘The items in focus are those that 
are talked about for a Part of the discourse. lems in 
focus can be used in two ways, actor focus and as 
discourse focus, The actor focus is a discourse item 
that is predicated as the ‘pent in some event, so for 
example, the poodle is the agent in cating Towers, 
while the discourse focus is, roughly speaking, an item 
the speaker wishes to make several predications 
about.4 


The focusing process, when explained from the 
viewpoint of a hearer, can be described as a process of 
tracking the speaker's foci as they change over the 
discourse. A hearer does not have privileged informa- 
tion about what is in a particular speaker's head, so 
the hearer must decide what the speaker is talking 
about on the basis of what the speaker uses as initial 
referring expressions and subsequent co-specifying 
anaphoric expressions, The hearer follows a focus and 
checks to see if the anaphoric expressions which the 
speaker uses either co- pecify with the hearer's repre- 
sentation of that focus or specify a representation 


—— 

4 This description is not o debmtion of discourse focus The 
discourse focus is defined by the effects of several focusing algor- 
ith, for detaits, see Sidner 4979 
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related to it The hearer is said to be tracking the 
speaker's focus because she or'he cun determing what 
is being Gilked about only afler the speaker Iras said 
something. Focus, as il is used here, is akin to Grosz’s 
notion [1977] of immediate Tocus. 


A process model of focusing and focus tracking 

consists of three distinct processors. The first chooses 
foci based on what the speaker initially says. The 
algorithm for this choice depends on many phenomena 
= see Sidner 1979 for details. Then an interpreter (the 
pronoun interpreter) uses these foci and a set of cules 
of pronoun interpretation (hereafter “pi-rules") to 
imterpret the anaphoric expressions in the discourse. 
This interpreter, like a human hearer, must keep in 
mind whatever other newly mentioned elements the 
speaker has introduced. since sometimes an anaphor 
may co-specify with one of those instead of the ele- 
ments in focus. A third process updates the foci by 
decisions that depend on anaphoric interpretations 
hosen by the pronoun interpreter. During this last 
hase, the updating process moves one of the foci to a 
new clement of the discourse, if some anaphor co- 
specified with that new clement while no anaphor 
co-specified with the phrase already in focus. The 
three processes taken together sketch a simple mode! 
of focus tracking; the model appears to behave like its 
human counterpart in the way it interprets anaphors 
and in the instances in which it fails to “understand.” 


So far. the process model looks circular, Foci are 
chosen, they are used to determine how anaphors co- 
specify with the foci and then the co-specifiers are 
used to determine what's in focus. In fact, the model 
is not circular, but its steps are cyclie. The processor 
cycles through the three processes for each sentence 
the discourse, This cycling differs from logical circu- 
larity because it depends on new information present- 
ed over time, the time of cach sentence of the dis- 
course. 


For example, reconsider the first few sentences of 
D2, given previously. Initially the speaker mentions 
two dogs, which become the discourse focus, and then 
he extends the discussion of them using ‘one...the 
other” anaphors. The actor focus is the speaker. Us- 
ing a definite noun phrase anaphor “the poodle,” the 
speaker turns his attention to the that dog, and it be- 
comes the discourse focus and the actor focus. Using 
another definite noun phrase anaphor “the cocker,” 
the speaker changes the discourse focus to the cocker. 
The speaker's concluding sentence expands the dis- 
course focus back to both dogs with the use of 
“them.” 


In Sidner 1979 a machine for choosing and updat- 
ing the discourse focus and actor focus is given. The 
machine chooses items in the discourse to serve as 
“expected” foci after the first sentence of a discour 
and uses the interpretation of pronouns as well as 
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hon-pronoun anaphors lo determine whether its ex- 
pected choices were correct. The machine updates the 
discourse and actor focus after cach sentence and 
changes foci when the anaphoric expressions no longer 
co-specify with the item in focus. This simple machine 
tracks the foci from the hearer’s point of view and 
shows how the hearer can recognize foci and changes 
of foci. In the remaining discussion, it is assumed that 
the focus machine can determine a discourse and an 
actor focus. Our attention is directed at the following 
concerns. How can these two foci be used to interpret 
discourse pronouns? What rules are used by the pro- 
noun interpreter? How do these rules make use of the 
constraints on the theory of pronoun interpretation 
discussed earlier in this paper? 


The focusing theory of pronoun interpretation can 
be outlined as follows. At any time in the discourse 
after the first sentence, there are foci that are the 
prime candidates for co-specifying with a pronoun. 
Also available is a list of alternate candidates, called 
potential foci, for each focus. Unless the focus candi- 
date is ruled owt by certain criteria (to be discussed 
below), the pronoun interpreter uses it to determine 
the specification of the pronoun, Most of the discus- 
sion in this paper concerns the criteria that must be 
encoded in the rules for choosing among the foci as 
well as the criteria for rejecting a focus in favor of one 
of the potential foci. 


The proposal made here contains two implicit proc- 
essing assumptions, (1) serial processing, and (2) end- 
of-sentence processing By “serial processing”, | 
mean that the interpreter checks a focus as a candidate 
in interpreting the pronoun, and then if that focus is 
unacceptable, checks alternate candidates in turn. By 
“end-of-sentence processing’, | mean that pronouns 
are not interpreted until the entire sentence has been 
syntactically and semantically interpreted. Both of 
these crileria could be given up without undermining 
the focusing theory. One could envision processing in 
parallel by checking the foci and alternates and then 
determining the pronown’s specification from an order- 
ing of all those candidates that meet the criteria of 
choice. To interpret pronouns before the end of the 
sentence, the interpreter could decide using available 
information and then review its choice as more of the 
sentence is processed. These two implicit processing 
assumptions have been made because they simplify the 
account of focus and because they reflect an imple- 
mented version of one focus system. Further research 
will indicate whether these assumptions are too strong 
— if so, the focusing theory may be re-tested under 
parallel and mid-sentence interpretation assumptions. 


4. Using Focus for Pronoun Interpretation Rules 


The pronouns in the example discourse in the pre- 
ceding section can be interpreted with a simple rule: 
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Ri: Choose the discourse focus as the co- 
specifier of the pronoun. But if the pro- 
noun appears in the agent position of the 
verb case frame structure, then choose the 
actor focus. 


R1 is a naive formulation and fails to predict accu- 
rately over a wide range of phenomena. It can be 
tevised to include criteria from syntax, semantics and 
Pragmatics, as well as criteria about discourse charac- 
teristics. fn the next several pages, | present criteria 
that make use of all four sources of knowledge and 
incorporate them into RI to form a series of pi-rules: 
these rules make use of the discourse and actor foci, 
the potential foci, and the Processing of the focus 
machine. 

Syntax, semantics and Pragmatics can be included 
in an initial way by the modification R2: 

R2: Ifa pronoun appears in a verb case frame 
relation other than agent, choose the dis- 
course focus as co-specifier unless any of 
the syntactic, semantic and pragmatic 
knowledge constraints rule out the choice, 

If the pronoun appears in agent position, 
choose the actor focus as co-specifier in 
the same way. 


Ruling out a co-spei ron the basis of syntactic 
and semantic constraints is accomplished by computing 
the various syntactic relationships and restrictions 
(such as Lasnik's disjoint reference rules) and by use 
of semantic selectional restrictions on case frame cate- 
gories. How one uses general knowledge constraints is 
not so obvious; one wants to avoid the problems of 
contro} on inference mentioned previously. 


The focus rule above, although only a skeleton of 
the Full rules needed, contains the basis for control of 
inferences. Under the method of rule R2, inference is 
no longer used to find a binding for the pronoun act- 
ing as la variable by forward and backward chaining. 
Instead inference is a constraint-checking process. 
The context up to the sentence under interpretation is 
integrated into memory: when a co-specifier is chosen 
for the pronoun, inferring serves only to find a contra- 
diction in the database. The database will also include 
an interpreted form of the sentence with the pronoun, 
with the pronoun replaced by its specification as pre- 
dicted from the phrase in focus. Only when a contra- 
diction is found, is the suggested specification given up 
in favor of another choice. The foci of the discourse 
can be additionally helpful in database inference be- 
cause only that part of the database that concerns 
what is in focus needs to be explored for contradic- 
tions Some research on truth maintenance systems 
[Doyle 1978, MeAllester 1978] has experimented with 
constraint-checking and developed algorithms for effi- 
ciently finding and undoing contradictions; however, 
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no one has considered how to explore only certain 
“sub-scctions" of a database for contradictions, 


On this account of inferring and pronoun interpre! 
(ation, the inference machine must still infer proposi 
tions and reach contradictions, However, this metho 
reduces the search needed to make sense of the ne 
sentence because specifie choices for the Pronouns ard 
given; when a contradiction is reached, a new choice i 
made rather than either blindly binding the pronoun t 
some other object or blindly searching for some other 
Proposition to change, so that the contradiction is 
eliminated, 


When a suggested co-specifier for a pronoun must 
be given up, the pronoun interpreter must use the 
potential foci as possible co-specifiers. However, the 
interpreters actual choices require consideration of 
several matters, and before any rules can be stated, 
several motivating cases must be Presented, 

D3-1 Alfred and Zohar liked to play baseball. E) 
2 They played it everyday after school 
before dinner, 
3 After their game, Alfred and Zohar had 
ice cream cones, 
4 The boys thought they ta 
5 Allred always had the vanilla super scooper, 
6 while Zohar tried the flavor of the day cone. 
7 After the cones had been caten, 
8 the boys went home to study. 
In D3, Alfred and Zohar are the initial actor focus 
while baseball is the initial discourse focus. D3-2 
conlains two pronouns, “they” and “itt which, ac- 
cording to R2, co-specify respectively with Alfred and 
Zohar, and baseball. D3-3 uses “their,” which co- 
specifies with Alfred and Zohar, but is not accounted 
for by rule R2. Furthermore “they” in D3-4 does vD 
co-specify with baseball but with ice cream cones! 
What has happened? 


ted really good. 


Suppose in addition to the ongoing actor and dis- 
course focus, the hearer can consider temporarily any 
new entities mentioned in the last sentence that the 
hearer has heard. ‘These entities, the potential foci. are 
dropped out of Processing use if the speaker fails to 
mention them a second time: such alternatives are 
elements the speaker may want to say more about, but 
their importance to the speaker cannot be determined 
from one sentence. ‘Thus in 93-2, foci include the 
event of playing, and the times “everyday”, “before 
dinner,” and “after school", None of these potentials 
survives long into the discourse as D3-3 fails 10 pick 
up on any of them. D3-3 in turn introduces some 
other potential foci, and one of them, ice cream cones, 
is discussed in 3-4: notice alsa that D3-4 fails to 
mention baseball in any way 
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R2 must be extended so trat it uses the potential 
foci whenever eriteria from syntax, semantics or prag- 
matic knowledge rule out the current actor or dis- 
course focus. ‘Thus since “they” in D3-4 cannot co- 
specify with baseball (on both syntactic® and semantic® 
trounds), a potential focus is chosen, Potential foci 
are ordered according to their thematic relation’ in a 
sentence, with semantic case objects first, and agents 
last. The first potential focus Hat meets all the const- 
raints is chosen as the co-speeifier; in D3, ice cream 
cones is the first acceptable potential focus that meets 
all the necessary constraints as a co-specifier for 
“they.” 


Use of this modified pronoun rule follows hand-in- 
hand with the focus machine discussed previously. 
The machine updates its discourse model after cach 
sentence by tracking, among other things, pronoun 
use, When a pronoun is used to co-specily with a 
phrase that is a potential focus, and no phrase co- 
specifies the current focus, either the discourse focus 
or actor focus moves to that potential focus. Which of 
the two foci moves depends on whether the pronoun 
fills the agent case in the verb frame, and in the case 
of multiple agent cases, whether the ongoing actor 
focus is re-mentioned. For example, after D3-4, the 
discourse focus changes to ice cream cones because 
“they co-specifies with the ice cream cones; the boys 
remain the actor focus, since “they” is not an agent 
case for “taste” and since the boys were already the 
actor focus. 


So far pi-rules take into account the movement of 
focus and constraints on syntax, semantics and prag 
matics. However, some of the semantic and pragmatic 
criteria must be expanded beyond the simple case 
frame semantics and representation of noun phrases 
given carlicr. 


As the reader may have concluded, the interaction 
between actor and discourse foci is a complex one; to 
supplement the case Trame semantics, some functional 
notion of theme (in the sense of Cunetionalists such us 
Halliday 1967) is needed. For example, the proper 
co-specifier for the pronoun in 14-3 cannot be chosen 
with R2 and only simple case frame relations 


D4-1 
2 Cart thinks he's studying lor his exams. 


I haven't seen Jeff for several days. 


3 But | think he went to the Cape with Linda. 
Although Carl is the actor locus after 14-2, and “he” 
in D4-3 is the actor case in the embedded sentence, 
the proper choice for the co-specilier is Jeff, Howev- 


x 


since “thev ds a plural pronoun ael aseba is singulair, 

® since the discourse nems Filling Ue objet case of taste 
should be tastable nems, 
This type ol orderne s monvated by the determination of 


an intial focus See Sater 1979 for details 
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er, with simple case frame semantics, the interpreter 
must consider “Carl” as the co-specificr and then fail 
to rule it out (because no syntactic, semantic or 
knowledge constraints can eliminate ww) 


I have designed the pi-rules lo use a semantics that 
marks verbs like “thinking” or “talking” with who the 
thinking or talking was about, D4-3 would indicate 
that the speaker thinks about X, that X went to the 
Cape with Linda. With this semantics, the interpreter 
follows a rule to prefer the discourse focus as the co- 
specifier of any pronoun filling the theme position in 
such a verb. Since the actor focus is associated with 
the agent case, use of such a theme-based rule is con- 
sistent with the framework. With such a rule, Jeff is 
chosen as the co-specifier of “he,” not because Carl 
couldn't have gone to the Cape with Linda, but be- 
cause the speaker is talking about Jeff and his 
thoughts about Jeff, In ence this approach takes 
the discourse focus as primary, the discourse focus 
being what the speaker is talking about so far, while 
the actor focus is the jocus of information about ac- 
tions in the discourse. 


The theme-focus rule does not indicate what to do 
when the discourse focus and actor focus are both 
animate, have the same gender, number and person, 
and are both established during the same sentence of 
the discourse. People sometimes have difficulty 
choosing interpretations in such circumstances. In 
D5-2a below, “he co-specilies with “John” (the actor 
focus) but if D5-2b followed D5-1, “he” may co- 
specify with cither John or Mike (the discourse focus). 


D5-1 John called up Mike yesterday. 
2 a He wanted to discuss his physics homework. 


b He was studying for his driver's test. 


In these cases, native speakers report that the co- 
specifier for the pronoun is ambiguous. If the pro- 
noun fills an agent case, the actor is preferred, but this 
preference is not a strong one. it appears that in such 
cases the ambiguily may not be easily resolved unless 
additional information about the two foci is known 
that stipulates that the sentence is true of only one. 


Poten foci can be sub-categorized as either po- 
tential discourse foci or as potential actor foci. A 
potential actor is a noun phrase which specifies a data- 
base clement marked as animate and which does not 
occur in agent position. In most sentences, the noun 
phrase in agent position contains a descriptive word or 
name in the head, and specifies a database element; it 
becomes the actor focus. But when the noun phrase 
in agent position is a pronoun, it may co-specify with 
either the actor focus (if one exists), or a potential 
actor. Ambiguities occur when an actor and one po- 
tential actor are both present in a previous sentence 
and when the discourse focus is a non-actor entity. 
An example is given below. 
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Consider the examples where s5 follows cach of sh, 
57, s8 and 59. 
s5 He knows a lot about high energy physics: 


s6 Prof. Darby will tell Monty about the 
neutron experiment. 


s7 Prof, Darby will lecture Monty on the 
neutron experiment 


s8 Prof. Darby will help Monty with the 
neutron experiment. T 
s9 Prof. Darby will teach Monty about the 
neutron experiment. 
Some native speakers find all of these sentence pairs 
ambiguous, while some native speakers find only the 
pair s6 followed by s5 ambiguous. These cxamples are 
surprisingly similar to D5. How do some speakers 
decide that “he” co-specifies with Monty or Prof. 
Darby? It appears that they make a comparison and 
choose between the actor focus and the potential actor 
on the basis of evidence for their preferred interpreta- 
tion. When that evidence is not forthcoming, infor- 
mants are confused. Such a behavior suggests that the 
inference mechanism shouid be able to judge prefer- 
ences between a given actor and one potential actor. 


A computational system that makes such judgments 
must have a very rich knowledge base (c.g., to know 
that Monty is a male name, and thal professors may 
be male) and must be able to infer which actor is pref- 
erable fram that base. A computational framework 
for carrying out such subtle judgments is still beyond 
the state of the art, although Marcus [980 has pro- 
Posed a semantic choice mechanism that must also 
weigh evidence for prepositional phrase attachment; 
his parser, when attaching prepositional phrases, asks 
the semantic processor about its preferences for where 
to attach the phrases. 


in summary, the interpreter can use the following 
condition for these ambiguities 


POTENTIAL ACTOR AMBIGUITY CONDI- 

TION: Whenever a pronoun may co-specify the 

actor focus, and a single potential actor exists, 

expect a possible ambiguity. To resolve, 

1. If there is evidence supporting the actor fo- 
cus as the co-specifier, but not the potential 
actor, then the actor focus is the co-spccificr. 


2. When evidence supports the potential actor 
but not the actor focus, select the potential 
actor as the co-spccifier. 

3. However, if there is evidence for both, select 
the actor focus but indicate ambiguity 


A summary of a full set of pronoun rules is piven in 
the appendix to this paper. These rules represent what 
can be said about pronoun interpretation in the ab- 
sence of any additional information in knowledge rep- 
resentation beyond that suggested in the discussion of 
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co-specification. To interpret certain pronouns, su 
as those where a co-specifying phrase does nol prd- 
cede the pronoun in the discourse, as in DI, we mujt 
consider how knowledge is structured and represented 


5. Focus and Knowledge Representation 
Focusing theory with the syntactic and semanti 
representations discussed initially does not include al 
account of the representation of sentence pairs such a! 
D6 where the sentence is ambiguous due to scope ol 
quantifiers. 
D6-1 Wendy showed cach girl Bruce knows a cat. | 
2 a She had found it at Farmer John's. 
b They were all from the same litter. 
At present the focusing rules predict only that “a eat) 
is the expected discourse focus, but they say nothing 
about the scope relations for representing the seman 
tics of the phrase as part of D6-1. Without scope 
relations, D6-2b is ruled out by the number agree 
criteria when in fact most speakers say that "tl 
co-specifies with the set of cats that may be evoked i 
D6-1. Webber 1978 not only argues in detail for rep 
resenting scope relations in the mechanisms for inter 
preting anaphors, but also gives rules for determinin; 
scope in a representation of phrases that may be co 
specifiers. ° 
On Webber's analy: 
senlaltions for “a cat 
knows.” The one for “a cat" 
I. R1: the cat associated with 6-1 such that Wendy, 
showed it to cach girl Bruce knows, 


| 
| 
a D6-1 will have two reprey 
and for “each girl Brued 


can be paraphrased as: 


2. R2: the set of cats, the members of which are assod 
ciated with D6-1 such that Wendy showed (one of) 
them to cach girl Bruce knows. 


Webber has suggested wher the representation must 
be, and what remains to be determined is when it is 
used. Let us suppose that D6-1 is ambiguous.” and 
syntactic or semantic processing of it adjudicates 1 
two readings RI and R2 for “a cat.” When the pro- 
noun interpreter secks to co-specify “it” in D6-2a 
with the discourse focus, both readings will be availa- 
ble. The set reading R2 may be eliminated immediate- 
ly because of constraints that the co-specifier repre- 
sent a singleton, so RE is left. In contrast, for D6-2b 
the reading R1 will be ruled out since “they” requires 
a plural co-specifier. This account both explains how 
the pronouns may be understood, and also is consist- 
ent with Van Lebn's findings [1978] that people do 


# A question, often raised by Martin (see, for example, Martin 
1979}, is whether a sentence that as ambiguous ami several 
7 gs mest be represented by several different structures, ane for 
eh reading, We offers a semantic representation that Preserves 
ambiguity until seme processor demands a refinement Whether 
approach or an aheenative representation comaining both 
readings is best is stil an open question | 
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not normally disambiguate certain ambiguous quantili- 
er scopes during sentence understanding for sentences 
such as D6-1. His compatible with Van Lehn's find- 
ings because alternative interpretations of scope are 
not considered until additional discou il be- 
yond the single sentence is presented. 


Webber's rules also allow the phrase “each girl 
Bruce knows” to be interpreted as a prototype, with 
the result that “a cat" is also. Most speakers Find this 
reading very odd, and one can ask how such a reading 
is eliminated. Possibly it never generated, and 
Webber's rules need modification, Alternatively it is 
produced, and then is ruled out, vither in understand- 
ing D6- because showing prototypic cuts is bizarre, 
or because in interpreting “it for DG-2a, the infer- 
ence mechanism balks at Wendy having found a proto- 
typic cat at Farmer Brown's. 1 think that such read- 
ings never occur in the first place, simply because most 
speakers, who do not report this reading themselves, 

ave difficully understanding the reading when they 


“are told about it. 


Another case of semantic ambiguity, similar to the 
one in D6, is illustrated in D7. 
D7-1 Sally wanted to buy a vegomatic. 
2 She had seen it advertised on TV. 


“A vegomatic™ may be interpreted spe ally, 10 
mean that there is one particular vegomatic or non- 
specifically, to mean that it is one of the many vego- 
matics.” The focusing rules do not distinguish between 
the two after D7-1 because, fike D6-1, D7-1 is ambi- 
guous, and neither interpretation can be chosen with 
certainty. When “it” is resolved for co-specification 
in D7-2, the inference mechanism must decide that 
Sally does not want the very one used in the advertise- 
ment on TY. “A vegomatic.” whether understood as 
specific or non-specific, specifies a different represent- 
ation than “it does. ‘Therefore the pronoun's specifi- 
SutiON as a spec vegomalie must be generated from 
the ambiguous use. 


Suppose, Tor a moment, tiat D7-1 is interpreted so 
ihat a representation that maintains ambiguity is avail- 
able. When the pronoun interpreter processes a sub- 
sequent sentence with a pronoun, it need only rule out 
readings if the inference machine discounts as contra- 
dictory one of the readings (Sally didn’t want to buy 
the very one she saw on TV). If no reading is ruled 
out, the co-specifier would remain ambiguous, so that 
both the indefinite phrase and the pronoun would have 


¥ 


The terms “non-specific and speeda? we traditional 
semantic expressions that bear no reiijion to “speaty and 
vapecification ®© A non-specific reading of “a dog” would be inter- 
preted to produce a representation of an instance of the prototypie 
dog. what is represented is a dog which bas tbe characteristics of 
the prototypic dag = ep. wi al wah tani Jeps, a nil, medium 
size, brown, frenHy. barks, and the hke 
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ambiguous co-specifications. As the next example 
shows, there is some evidence for this behavior- 


Consider the case shown in D8. 
D8-1 Sally bought a vegomatic that had a 
broken cutting blade. 
2 She had seen it advertised on TV, 
“A vegomatic that had a broken cutting blade” in the 
context of D&-1 usually means some particular vego- 
matic that Sally bought. However, “it? is ambiguous 
among the vegomatic Sally bought (the broken one), 
some vegomatic (possibly not broken), and a vegomat- 
ie that is an instance of prototypic vegomatic. Thus 


it” is three ways ambiguous. 


To understand D8-2, the pronoun interpreter does 
nol distinguish among the three readings, since it ac- 
cesses the one provided by the specification in D8-1, 
which is the specific reading. To find the specification 
of "it," the inference mechanism must discover that it 
is slightly odd (1) for Sally to Nave seen the vegomatic 
with a broken blade which she bought being advertised 
on TV, and (2) for Sally to see any broken vegomatics 
on TV, and (3) for Sally to have seen the very one she 
bought on TV. Then if no other choices for co- 
specification are available, the specification of vego- 
matic from D8-1 must be used to generate an appro- 
priate specification for “it”. Since only unbroken ones 
not bought by Sally are appropriate, the specification 
of “it” must be generated using only part of the 
phrase from D8-1. 


This example seems problematic because it places 
much weight on the inference machine to decide that 
the reading is odd. However, this is likely to be just 
where the weight of the decision ought to be; many 
native speakers find D8 slightly bizarre because their 
first reading is that Sally had seen the vegomatic with 
the broken blade advertised on TV. In fact, it appears 
that when a specific indefinite noun phrase such as “a 
vegomatic™ is introduced, and the speaker wants to 
turn atenton to the non-specific reading, a plural 
pronoun is used as shown below: 

slQ She had seen them advertised on TV. 

The plural non-specific reading as in s10 is mentioned 
in the pronoun rules found in the appendix, but the 
generation of a specification as in D&-2 is not. 

Examples such as D8 are perplexing for another 
reason; they are examples of what I call, following 
Fahiman 1977, the “copy phenomenon.” The ambigu- 
ity centers around the fact that there can be many 
copies of an abstract prototype. Automobiles, com- 
puter programs, airplane flights and money are other 
common cases of entities that exhibit the copy phe- 
nomenon. In D9, the interpretation of “it? depends 


on whether the speaker is referring to a particular 
flight or the norma! Sunday flight, a copy of which 
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occurred on “this Sunday" because of the use of 
“usually”. 
D9-1 TWA 384 was so bumpy this Sunday I 
almost got sick. 
2 H usually is a very smooth flight. 

Note th the “it connor to-specify with the partieutir 
flight on “this Sunday. However, it is Possible that 
the speaker used “TWA 384" to refer to a particular 
THght, if this is so, the speaker mixed the specific and 
non-specific interpretations for the co-specifier of 
“it. Just as in D4, a specification for the pronoun 
must be generated — in this case, a non-specific one 
from the specific reading. 


Another characteristic anaphora is the bound varin- 
ble case described: by Partee [1978, 1972|. In DIG 
below! “him” c@-specifies with Archibald, while if the 
reflexive himself” were used, it would involve a vari- 
able bound to the quantifier from “every man.” 
D10-1 Archibald sat down on the floor. 

2 Every man put a screen in front of him. 
In linguistic theory, bound variabies are assumed to be 
represented in sentence semantics; when used in con- 
junction with syntactic disjoint reference rules, pro- 
nouns within the scope of the quantifier can be distin- 
guished from non-scoped ones. Since the pronoun 
interpreter takes account of these conditions, it can 
easily choose a Proper co-specifier for “him” in DIO 
in terms of the focus, but for “himself” it wall recog- 
nize the bound relation to “every man.” tt is crucial 
to these, cases that the representation of the interpreta- 
tion of |a sentence includes scope of quantification, 
especially when the scope is unambiguous. 


6. Focus Restrictions on Co-specification 


There are other restrictions on co-specification that 
result from the Processing of the focus machine. The 
focus machine includes a stack on which old foci are 
Stored when the focus changes. In addition to co- 
specifications with current actor and discourse foci, a 
speaker may use a pronoun to co-specify a discourse 
element that was once in focus but is no longer, Grosz 
1977 described and illustrated this behavior for ana- 
phoric ndun phrases in task-oriented dialogues, 


A pi-rule that selects andidates from the focus 
Slack can capture this behavior. However, the rule 
musi be further constrained. An anaphor that is in- 
tended to} co-specify with a Stacked focus must not be 
acceptable as a co-specifier with cither the foci or 

otential foci. An example from a literary toxt!© tius- 
trates how, 

Was that old lady evil, the one Saul and 1 
had sekn sitting on the porch? I had dreamed 
about her. When the trolley car took me and 
Saul past her house again this morning. she was 
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gone, Evil, it had a queer sound to it in Eng- 
fish. 1 
[Here the narrative moves on to an incident 
in a sehool classroom. A discussion between the 
speaker and a male teacher ensues for five para- 
graphs, Phen the following paragraph bepins: | 
She had worn an okt brown coal and a green 
seart over her bead. 
In this example, “she” co-specifivs with the old lady 
discussed Previously, IE Potok had told of a discussion 
hetween the speaker and a Temale teacher, it would no 
longer be possible to tell that he was co-specilying 
with the off woman, While interpreting the reading of 
“she” as teacher might be a bit surprising because the 
her's clothing was not relevant to the previous 
conversation, no inference can rule out that reading 
because teachers may wear old brown coats, Hence 
the pronoun interpreter must reflect these facts, and it 
does so with the stacked focus constraint, 


The stacked focus constraint is not stated directly 
within the pi-rules. Instead it is implicit in their func- 
tion. The condition is as follows: A pronoun cannot 
be used to co pecify with a stacked focus when a 
current focus is an acceptable Co-specification since 
that current focus will be taken as the interpretation 
and the stacked focus wilt never come into considera- 
tion. The stacked focus constraint is a consequence of 
the movement of focus in the focus machinery. 


The stacked focus constraint, however, may be 
overridden, Grose (sce Deutsch 1975, 1974) has 
identified several examples, involving pumps and boli- 
ing operations, thu show that a pronoun may co- 
specify with the stacked focus even when intervening 
material contains possible co-specifiers. Some compli- 
cated sel of inferences about what can and cannot be 
bolted to what, or what can and cannot be loosened 
might be able to rule aut the intervening possible co- 
specifications. However there Uppears to be too little 
delay in understanding for people who read her ex- 
Cerpts (there is no means of determining whether the 
original speaker and hearer experienced any delay in 
understanding) to Suggest that they are ruling out mul- 
tiple possibiliuies in mierpreung the pronouns.!t 
Hence one may conclude that the Speaker und hearer 
are taking advantage of their knowledge of the task to 
Provide a discourse context in which the focusing ma- 
chinery can be applied, 


How many such discourse interpretuion mecha- 
nisms exist? While this Paper does not address this 
question directly, some specukition is possible on the 


W In the Begimni by Chaim Pook, page 242, chapter 4, 


Fawcett Publications. Ine Conn, 1975, 

H This informal evidence needs to be tested vut in a psycho- 
lical laboratory 1 have nat done so, bur the results of such 
experutmation woul? be feveating 
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as of researeh that has been reported chew here (se 
Sidner 1979, Robinson 1981). tn general, it appears 
that discourses permitti violations of the stacked 
focus constraint must contain an implicit: structure of 
task completion that guides the hearer tow context in 
which the foci for that context may be re-used. With- 
out this structure the hearer cannol decide that the 
speaker intended For the pronoun to co-specily with 
something other than the representation of the object 
currently under discussion, 


hi 


7. Pronouns Which Have No Co-specifiers 


The previous discussion has assumed that a pro- 
noun is always preceded by a co-specifying phrase. 
However, this is not always the case, and a complete 
theory of pronoun interpretation must address cases 
where the co-specifying phrase appears after the pro- 
noun, and where no co-specifier exists, but the discus- 
sion implies a specification for the pronoun. 


Pronouns used with their co-specifiers appearing 
after the occurrence of the pronoun have been called 
backward anaphora in the linguistic literature; 1 refer 
to them as forward co-spectfiers, “Two such examples, 
s11 and s12, are given below. 

sti If he comes before the show, give John 

these Uckets and send Ann to the theatre, 

s12 Near fam. Dan saw a snake 
In general the pronoun co-specilics with some noun 
phrase interpretation, but the phrase is phiced forward 
in the discourse. ‘The types of sentences in which this 
behavior can occur are limited. In gencral it seems to 
be permitted for fronted sentential prepositional phras- 
es (us in $12), subordinate clauses fronted on another 
sentence (as in $11), and for sentences containing 
co-ordinating conjunctions.'? However, il is excluded, 
as far as I can tell, in the Following cases:! 

s13 * 1 heard about her job from Afary. 

sl4 * I spoke about fim with John’s wile. 
Extensive research in linguistics on forward co- 
specifiers (Solun contains a good review) gives reliable 
evidence that structural constraints govern il, in par- 
ticular, syntactic rules can be stated that determine 
when forward eo-spedifiers are not permitted, ‘The 
most recent formulation, by Solan, called the back- 
ward anaphora restriction, Fails on the cases below, 


92 Solan (978 cites the easinple. 
S15 Penelope grabbed his vane ant heat Peter with it 


VY Fhere is some disapri 
acceplahle AH the patiye sp akers 1 ase asked sail they imer- 
preted the pronoun as refering t Mary” ony hecmuse there were 
no ather ussat choices siel because you cbd hear about a persons 
job from that persoa., Since all the people 1 ashesi infoemally told 
me il Was a Strange semence For ah bam assuming that it is 
deviant 
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but some working modification of it may yet be forth- 
coming. 

s16 In her room Mary saw a ghost 

S17 1 heard about Mary's job [rom her. 


Whatever the best formulation of the syntactic 
rules for forward co-specifiers, they are preferable 
only in initial sentences of a discourse. For example, 
when stl occurs in mid-discourse, if a speaker has 
been talking about Henry, and just begun mention of 
Charl native speakers will interpret “he” as co- 
specifying with Henry, or Charles (if Henry can be 
ruled out on basis of some special pragmatic knowl- 
edge). The pi-rules using focus behave in exactly this 
way. They will permit a forward co-specifier only if 
Henry and Charles can be ruled out by syntactic, sc- 
mantic and world knowledge criteria. Were sil to 
occur when Lydia was in focus, the forward co- 
specifier would be possible, and pi-rules mirror this 
behavior. 


When faced with a pronoun that has no preceding 
co-specifier and is not a forward co-specificr, the pro- 
noun interpreter relies on a condition that is called the 
missing co-specifier condition. \n the remainder of this 
section, | describe that condition. The pi-rules include 
a rule for recognizing that a pronoun is missing its 
co-specifier, but they do not offer an interpretation for 
such pronouns. 


In cach of the cases below, the pronoun specifies 
some cognitive clement thal is rehited to one of the 
entities mentioned in dither the previous sentence or 
the sume sentence. These examples! are different 
from the non-specific prototypic readings of pronouns 
discussed earlier because no phrase which can be used 
as a generator exists, 

Dtt-1 J saw Mr. Smith the other day; you know, 
she died last year. 


2 John is an orphan. He mi them very 
much. 

3. Pro-Castra people don't believe Ae is a 
monster, 

4 Twent toa concert last night. They 


played Beethoven's ninth, 


5 I want to mect with Bruce next week. 
Please arrange i for us, 
6 Lused to be' quite a tennis player. Now 
when | get together with the young guys 
to play, | can hardly get fr over the net. 
With the exception of DII-1, most speaker-hearers are 
able to say which is the intended specification of the 
pronoun in the c above. Dil-1 can be understood 


14 The examples given here are From several sources, the first 
re from Postal 1969, the fourth from Chafe 1975, the fifth 
jogues collected Tor cae PAL system Sulner 1979], and the 
spoken by a lecturer at a presestation Jaltended 


three 
from 
last w 
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if the hearer is informed that Mr. Smith had a wife. 
However, some of these, especially | and 2, are so odd 
for most hearers that at first they fai! to comprehend 
the pronoun. Hearers are divided on the acceptability 
of 3, and most hearers find 4 and 5 acceptable. Fur- 
thermore, such examples, as far as I can tell, do not 
occur naturally in written samples. 

This paper does not give an account of just how 
such cases are understood. However, the focusing 
approach provides some basie structure that may be 
useful in generating an explanation for such situations. 
In all the multi-sentence cases, the pronoun specifies 
something which is closely associated with the focus. 
What is problematic is the fact that some speakers 
seem unable to understand a pronoun which specifics a 
database element that, while related to the item in 
focus, represents something that no longer exists, such 
as John’s parents in light of John's orphanhood 
Whatever the manner in which hearers recover specifi- 
cations for such pronouns, some principles are needed 
whith govern why some uses of pronouns are accepta- 
ble and others are not. 


8. The Prablem of Parallelism 


The pi-rules give incorrect predictions for certain 
uses of pronouns, uses that are difficult to define. 
Intuitively, they may be characterized as occurring 
when there is a parallel structure between sentences of 
a discourse.!5 In many of these cases the pi-rules pre- 
dict the wrong co-specifier, ‘To understand whut is 
meant by paralicl structure, two simple cases, one in 
which the pi-rules do predict correctly, and another in 
which they fail, are discussed. In DI2, the pronoun 
co-specifies with the mud pack, as the pi-rules would 
predict, The parallelism of these sentences is reflected 
in the semantics of “put on™ and “pull off" as well as 
in the similarity of the syntactic structure of the two 
sentences, each being in imperative mood. 

D12-1 Put the mud pack on your face. 

2 After 5 minutes, pull it off. 

The pi-rules predict the proper co-specifier in D12 
because the thematic relations of the verb follow the 
similarity of structure. However, in DE3 below, the 
pronoun, “it ce-specifics with rose und not with the 
green Whitierleaf (the pi-rule choice), The initial 
Focus after the first sentence is Whiticrleaf, but the 
parallel syntactic structure of the sentences seems to 
govern a-different choice of co-specifier. ‘To sumer 
rize, between similarity of structure and the pi-rules, 
similarity is preferred as a means of choosing a co- 


'S Hobbs 1979 defines a parallel retation between sentences 
s0 and 51 of a discourse as occurring when propositions PO and PI, 
which fotlow from SQ and SI respectively, have identical pre 
and arguments that are simitar (p 70). This concept of a 
relation seems related to my intuitive choracierization, but as de- 
fined docs not capture the cases } discuss. 
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specifier, so when each gives a different prediction, 

similarity of structure must be used. 

D13-1 The green Whitierleaf is most commonly 
found near the wild rose. 


2 The wild violet is Found near it too. 


At frst glance it appears that the pi-rules could be 
“fixed” by simply observing that the initial focus is 
wrong and that a potential focus should be chosen. 
However, no such option is available, for such a“ 
requires that the inference machine reject the initial 
focus. To do so, the inference mechanism needs some 
knowledge about the world that indicates the unac: 
ceptability, For DI3 no such knowledge could possi- 
bly be forthcoming since all the flora involved are 
found near one another, There is no knowledge to the 
effect that violets are found near wild roses and not 
near Whitierleats, 'e 


Another cxample of parallel structures is shown in 
D14. The parallel structures again are reflected in th 
similarity of the syntactic forms as well as the seman- 
tics of “most” and “mine”. After D14-1, the initial 
focus is the car radiator that is associated with each 
car of “most cars Using the focusing rules, the pro- 
noun interpreter will take “it” to co-specify with that 
radiator, But this prediction is incorrect; “it co- 
specifies with the radiator of the speaker's car. 17 
D14-1 On most cars the radiator has a free bolt hook. 

2 But on mine, it has a foating bolt hook. 


The use of “it” here is similar to the instance of a 
Prototype for “it in D8. The two discourse examples 
differ because D14-2 has an underlying semantic form 
that parallels D1l4-1, D14-1 specifies a universal set 
of cars and says something about one of the parts for 
those cars, DI4-2 specifies a set of one thing, the 
speaker's car, and says something about a part of it: 
the speaker's car is related to the universal car by 
instantiation. Thus “it” in D14-2 is not pointing to 
some instance of the prototypic radiator, it co- 
specifies with the radiator of the speaker's car, but 
“it is related intensionally to the “radiator” in Di4- 

The similarity in the underlying semantics of D14-1 
and D14-2 must be used in interpreting the pronoun 
uses, 


One might wish to construct some special-purpose 
mechanism Chit looks for similarities in structure be- 
tween two sentences, ‘This method is doomed for two 
reasons. Hirst, parallelism ex. in many aspects of 
language, and it happens at arbitrary levels of struc- 


16 tn certain cases a special audience may have afferent 
responses ta the parallelism above For example. botanists who 
know whit [owi are near others might behave differently, But 
even special audiences must sometimes use peneral techniques. 
Such is the s the DII example, because Whitierleats exist only 
in my imagination. 

"Ty thank RC 


Moore Jor suggesting this caample 
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lure rather than just syntactically. Second, al any 
given level, the problem of recognition of parallelism 
has plagued computational models of language since 
such models were first suggested. For example, pars- 
ing of English sentences containing conjunction is as 
yet an unsolved problem. Methods tried, such as 
those of Woods 1973 in LUNAR, fail because of over- 
generalization, Proper computational recognition of 
parallelism is still beyond the state of the art. 


The fact that interpretation of parallelism has failed 
for other aspects of computational models of language 
only indicates that the problem is i deep one, The 
example below is especially surprising because il dem- 
onstrates the use of parallel constructions thal may be 
found between whole paragraphs in a discourse. The 
interpretation of “the schedule,” used anaphorically in 
D15-7, is not ambiguous between “SOL” whieh is a 
kind of sehedule, and the trans sion schedule; read- 
ers recognize that the anaphor is unambiguous presum- 
ably because the process described in lines 5-7 paral- 
@ai: the one in lines 2-4. Whatever the proper account 
of parallelism for pronominal anaphora, it must also be 
generalizable to account for this kind of example. '* 


D15-1 The SOL is searched for an entry for the 
subscriber. 


2 Ione is found, the subscriber's relative 
transmission time is computed according 
to formula- 1. 

3 The subscriber's clock transmission time 
is computed according to formula-2. 

4 When the transmission time has been 
computed, it is inserted as the pri- 
mary entry ina transm n schedule. 

5 For each RATS entry, the RATS's rela- 
tive transmission time is computed 
according to formula-1, 


6 and the RATS's clock transmission time is 
© computed according to formula-2, 
rf 


The RATS transmission times are entered 
into the schedule. 

One possible consequence of these observations 
could be that the focus mechanism should be aban- 
doned in favor of some as yet unspecified mechanism 
that is able to determine parallel relations among sets 
of sentences in a discourse. However, methods for 
interpreting pronouns from parallel sentences and par- 
agraphs offer no constrictive way of interpreting the 
Pronouns in most of the exampl presented in this 


paper. Many cases ol eo-specihcanon occur where 
there ts no similarity of structure other than the com- 


IR For casier understanding, the reader must Know that the 
SOL has both subsensber and RATS catnes Flay cxample comes 
from Balzer el a) 1977, “information imn Program Speeitiesuon,” 
Proceedings of the Fifih Pnternaiiuneal Jom Conference on Arfi tal 
dareliigence, p 394 
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mon subject-verb-object pattern typical of English 
sentences. Since what is being talked about appears in 
many constituent positions in sentences of a discourse, 
the S-V-O pattern is too gross a level to specify simi- 
larity. Hence, while paraliclism is needed to deal with 
a certain set of cases for which the pi-rules predict 
incorrectly, the pi-rules are effective for many other 
cases of co-specification where parallelism would not 
be helpful. One may conclude that focus mechanisms 
account for one aspect of pragmatic anaphora, and 
that some different mechanism is needed to encode 
similarities in structure that sometimes occur in dis- 
course. This paper docs not give an account of such a 
mechanism. Rather, the examples above provide some 
additional observations about the nature of parallelism 
in interpreting pronouns in natural languages. 


9. Conclusions 

Two claims have been substantiated in this paper. 
First, | have shown that focusing is compatible with 
linguistic rules for disjoint reference, semantic selec- 
tional restrictions, and with representations delimiting 
quantifier scope. Furthermore, these sources of infor- 
mation are necessary for the pronoun interpretation 
rules. 


Second, | have demonstrated that focusing helps 
contro! the inference process needed to interpret pro- 
nouns. The pronoun interpreter predicts a co-specifier 
and then asks the inference machine for confirmation, 
when the machine finds a contradiction rather than 
consistency, the focusing process produces a new can- 
didate co-specifier. In effect, the focusing process and 
the inference machine collaborate on pronoun inter- 
pretation. In previous AI natural language systems, 
interpretation resulted from binding of free variables 
when making inference; the inference process could be 
characterized as one of proving a consequent from a 
set of premises. However, because the consequent of 
an inference rule contained free variables, many infer- 
ences had to be drawn and then “undone” during the 
search for a complete chain of inferences. The focus- 
ing approach eliminates this kind of blind binding and 
unbinding as well as shortens the inference chain 
search, 

A part of this paper has described and illustrated 
how the pronoun interpretation rules predict the co- 
specifiers of pronouns with both the actor and the 
discourse focus, and indicated that both actor and 
discourse Toci are necessary. Che rules show how 
many constraints — syntactic,  senwintic, and 
pragmatice! — affect the choice of specification for 
pronouns; they also show how many kinds of informa- 


© Phonological constraints have not been discussed in this 
paper. However, rules for contrastive stress and the like could be 
incorporated in the way that the other three classes are. 
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tion about the world of the spe 
art in distinguishing ihe co-speeifieations of pro- 
nouns. By means of the discourse 


ker and bearer play a 


tnd actor foci, the 


Ai-rules differentially apply the constraints; the predic- 


tons which result are tested in the database repr 


nt- 


ation of the speaker and hearer's world, Focusing 
chptures the effects of foregrounding (cf. Chafe 1975) 
since focusing accounts for the co-specifications of 
pronouns by means of the foci, and focus movement 
indicates how new entities may be foregrounded and 
pronominalized. 


In this paper I have demonstrated the types of ana- 


Dhorie uses that cannot be explained by the focusing 
theory These uses, the focus “popping” cases de 
scribed by Grose, and the Parallelism cases, tlusirate 
that in anaphor interpretation, other kinds of processes 
that dre computationally realizable and controllable 
are needed, Furthermore, an adequate syntactic theo- 
ryj of forward co-specificrs, which remains to be dis- 
covered, must be incorporated in the syntactic criteria 
for the pronoun interpretation rules. 


This paper further specifies the nature of focusing 


as it relates to a theory of pronoun interpretation. A 
focus-based theory with stipulations for syntax, se- 
Mantics and inferential knowledge, provides a pre- 
dictive and explanatory theory of pronoun interpreta- 
lion The theory is predictive because it stipulates 
legal and illegal pronoun uses as well as their interpre- 
tations; the limits of its predictive power have been 
demonstrated as well. The theory is explanatory be- 
cause several observations about anaphora in English, 
all crueial to focusing theory, can account for cach of 
the rules: that pronouns are signals of what is being 


discussed, that whenever the discussion change: 


‘, pro- 


nouns must signal the new element of discussion with- 
out confusion, and that hearers make use of syntactic, 
semantic, and pragmatic knowledge in a controlled 
way, in understanding anaphors. A theory of anaphor 
interpretation different from the focusing theory needs 
either to use these observations in an account of the 
facts or to demonstrate why these observations are not 


relevant to the interpre 


jon process. 


Appendix. Pronoun Interpretation Rules 


The rules in this appendix assume the availability of 


an actor focus, discourse focus, potential actor and 
potential discourse foci. The Pronoun interpreter ap- 
plics these rules in ihe order given, 


Agent position pronouns 


1 
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hen a sentence in which a prononn cecurs is the sec- 
oad sentence of a new discourse, the re acy rule may 
be applied. Recency rule. When a Pronoun is in subject 
position and ts the initial phrase in a sentence, and if a 
member of the potential (discourse or actor) foci occurs: 
as the fast phrase in the Previous sentence, test the pro- 
ndun for co-specifying with that potential focus 
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2. Theme sales Wien the pronoun occurs in an embedded, 

sentence, if the embedded sentence is marked as having 

a theme that is vither the discourse or actor focus, test 

the focus in that theme position ay the c pecifier of the 

Pronoun. 
3. Potential actor ambiguity condition: Whenever a pronoun 
may co-specify the actor focus, and a single potential 
actor exists, expect a possible ambiguity, To resolve. (1) 
if there as evidence supporting the actor focus as the 
co-specificr, but not the potential actor, thea the actor 
focus is the co-specifier, (2) When evidence supports 
the potential aclor but nol the actor Focus, choose the 
polental actor as the co-speedier, (3) However, if there 
ts evidence for hoth, choose the actor focus but indieate 
ambiguiiy. 


We Prononunaliod ator Pawns rube When Ube actor focus 
was fasi mentioned with i pronoun, if the a ent position 
Pronoun is of the same pender and person as the actor 
focus, il must coespeeity with the actor focus Ir it dogs 
not, a potential actor Focus may he chosen, but the pro- 


noun use is odd 

5. Plural rule: Af the pronoun in question ts plural, wif 
the actor focus is singular, test items in the following list 
for co-specification;: a non-specific reading of the focus 
(only fer foci that specify non-human entities), actor 
focus and potential actor foci, all the Potential actor foci 
together, the discourse focus, and the potential discourse 
foci. 

6. Basic rule: “Test the actor faci as a co-specifier with a 
Pronoun in agent position Followed by potential actor 
foci. If these fail, eheek the discourse focus, potential 
discourse foci and actor focus stack. 

7. Closure ride: Should all other rules Isil, if the pronoun 
occurs in an introductary clause, expeel a forward co- 
specifier. IT the pronoun is not in an introductory 
ekuse, an instance of the missing vo-specifier condition 
has occurred 


Non-ogent position pronouns 

L When a sentence in whieh a pronoun occurs is the sece 
ond sentence of a new discourse. the recency rale r 
be applied. Reconey rule: When a pronoun is in subject 
position and is the initial phrase in a sentence, and if 
member of the potential (discourse or actor) foci occurs 
as the fast phrase in the previous sentence, lest the pro- 
noun for cosspecit yun: with that Potential Facies 


2. Mavic rule Cheek the discaurse focus, followed hy po- 
lentil discourse favi, followed hy thepaetor focus. 
3. Plural pronoun rules W the discourse focus is singular 


and the pronoun is Dlural lest ihe Non-specific reading 
(only for foc ihat specily non-human entities} for eo- 
Specification, then potential discourse. foci, followed by 
the actor focus 


d. Fom rekuted tem rule VW some discourse entity has 
been related to the focus during the discourse, test it for 
cosspecifying with the pronoun in question 

S. Focus stack) Check lems in the focus stick for co- 
specifying in last-in first-out order 

6. Closure rule: When all the above rules fail, if the pro- 
noun occurs in an introductory chiuse, expect a forward 
co-specifier, Otherwise the Pronoun is an instance of a 
missing co-specitier. 
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